Clarify print output to specify posterior draws and log-likelihood terms by ishaan-arora-1 · Pull Request #328 · stan-dev/loo

ishaan-arora-1 · 2026-02-21T21:32:02Z

Hey! This addresses #198, where the current print output like

Computed from 4000 by 262 log-likelihood matrix

was flagged as ambiguous — it's not immediately clear which dimension is the number of posterior draws and which is the number of observations/log-likelihood terms. As mentioned in the issue, ParetoSmooth.jl already uses a more explicit format, and this kind of ambiguity can hide bugs (e.g. a user accidentally placing all observations in one multivariate normal).

Changes

The new output format is:

Computed from 4000 posterior draws and 262 log-likelihood terms.

This is close to what @avehtari proposed in the issue comments ("Computed from 4000 posterior draws by 262 log-likelihood terms matrix"), but I dropped "matrix" since the user doesn't really need to know it's a matrix — they care about what the numbers mean.

For log-weight objects (raw psis/tis/sis):

Computed from 1000 posterior draws and 32 log-weight terms.

For subsampled loo:

Computed from 4000 posterior draws and 100 subsampled log-likelihood
terms from 3020 total observations.

Updated methods

print_dims.psis_loo
print_dims.importance_sampling
print_dims.importance_sampling_loo
print_dims.waic
print_dims.psis_loo_ss
print_dims.elpd_generic

Also updated

All corresponding tests in test_print_plot.R and test_loo_subsampling_cases.R
Snapshot files for psis.md, tisis.md, and loo_moment_matching.md
Vignette output in loo2-with-rstan.Rmd and loo2-large-data.Rmd
NEWS.md entry

Fixes #198

The previous print output "Computed from 4000 by 262 log-likelihood matrix" was ambiguous about which dimension was draws and which was observations. This made it easy for users to miss cases where they accidentally misspecified the log-likelihood (e.g. placing all observations in one multivariate normal). The new output explicitly labels both dimensions: "Computed from 4000 posterior draws and 262 log-likelihood terms." Updated all print_dims methods (psis_loo, importance_sampling, importance_sampling_loo, waic, psis_loo_ss, elpd_generic), along with corresponding tests, snapshots, and vignette output. Fixes stan-dev#198

avehtari · 2026-02-23T16:10:17Z

Hi @ishaan-arora-1, can you clarify why did you close this PR? Based on a quick look, it seems to be useful

ishaan-arora-1 · 2026-02-23T16:13:33Z

Hi @ishaan-arora-1, can you clarify why did you close this PR? Based on a quick look, it seems to be useful

Hey, i just wanted to review, why the tests weren't passing before reopening it.
I've reopened the pr now.

I am happy that this pr sounds useful. i'll try to get the tests working in a few hours.

avehtari · 2026-02-23T16:28:52Z

In some of our repos there have been some CI tests failing due to Windows problems. It may take some days before your recent PRs are reviewed. Thanks for submitting them

VisruthSK · 2026-02-23T16:31:08Z

Glancing at the failed logs, I think tests need to be upated to reflect the new documentation.

ishaan-arora-1 · 2026-02-23T16:34:05Z

do you maybe want a hand in that? just lemme know, ill be happy to contribute.

Glancing at the failed logs, I think tests need to be upated to reflect the new documentation.

VisruthSK · 2026-02-23T16:41:27Z

Yes, do you want to take a stab at it?

The tests are expecting "subsampled log-likelihood values" but the new message is "subsampled log-likelihood terms", so the test case should reflect that.

ishaan-arora-1 · 2026-02-23T16:51:47Z

Yes, do you want to take a stab at it?

The tests are expecting "subsampled log-likelihood values" but the new message is "subsampled log-likelihood terms", so the test case should reflect that.

sure, would love to :)

The print_dims.psis_loo_ss method now says "log-likelihood terms" instead of "log-likelihood values", so update the test expectations to match.

ishaan-arora-1 · 2026-02-24T15:08:20Z

Fixed! The tests in test_loo_subsampling_cases.R were checking for "subsampled log-likelihood\nvalues" but the updated print_dims.psis_loo_ss now outputs "terms" instead of "values". Updated all 4 occurrences.

Ran the full test suite locally, 1160 pass, 0 fail, 0 warnings (2 skips are the usual M1 Mac platform skips).

Any feedback @avehtari @VisruthSK

codecov-commenter · 2026-02-24T15:16:46Z

Codecov Report

❌ Patch coverage is 85.71429% with 3 lines in your changes missing coverage. Please review.
✅ Project coverage is 92.78%. Comparing base (2883f25) to head (09d56b2).
⚠️ Report is 24 commits behind head on master.

Files with missing lines	Patch %	Lines
R/elpd.R	0.00%	3 Missing ⚠️

Additional details and impacted files

@@           Coverage Diff           @@
##           master     #328   +/-   ##
=======================================
  Coverage   92.78%   92.78%           
=======================================
  Files          31       31           
  Lines        2992     2993    +1     
=======================================
+ Hits         2776     2777    +1     
  Misses        216      216

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

VisruthSK · 2026-02-24T15:17:19Z

Lgtm. If Aki likes the language I think this is good to merge.

The only thing I can think of is maybe the tests that you just had to fix should be snapshot tests instead, but that change needn't be made in this PR.

ishaan-arora-1 · 2026-02-24T15:18:09Z

Lgtm. If Aki likes the language I think this is good to merge.

The only thing I can think of is maybe the tests that you just had to fix should be snapshot tests instead, but that change needn't be made in this PR.

sounds good, i can do that in another pr.

ishaan-arora-1 · 2026-02-25T17:09:07Z

@avehtari any comments?

avehtari · 2026-03-06T16:17:37Z

Computed from 4000 posterior draws and 262 log-likelihood terms.

I like this one.

Computed from 4000 posterior draws and 100 subsampled log-likelihood
terms from 3020 total observations.

I don't know right away how this should be said, but subsampling loo uses 1) faster but more biased computation with 4000 posterior draws and 3020 log-likelihood terms and 2) slower but more accurate computation with 4000 posterior draws and 100 subsampled log-likelihood terms

avehtari · 2026-03-18T16:07:03Z

The text for subsampling case in the latest commit is too long. For example, with the example in tests, the text is 225 characters. 1) With loo_subsample() I think that both the fast and slow part use the same number of posterior draws, so that part would not need to be repeated. 2) If the more compact text is still over 80 characters, it would be better to divide in two lines (hopefully three lines is not needed).

Before making any additional commits, make a proposal for new text here in the discussion, and after we have agreed on good new text, then you can make a commit for that. This will save you time, as you don't need to change the tests before we have agreed on the text.

ishaan-arora-1 marked this pull request as draft February 21, 2026 21:34

ishaan-arora-1 marked this pull request as ready for review February 21, 2026 21:36

ishaan-arora-1 closed this Feb 21, 2026

ishaan-arora-1 reopened this Feb 23, 2026

Update tests to match new print output wording

74dfab9

The print_dims.psis_loo_ss method now says "log-likelihood terms" instead of "log-likelihood values", so update the test expectations to match.

VisruthSK requested review from VisruthSK and avehtari February 24, 2026 15:14

VisruthSK approved these changes Feb 24, 2026

View reviewed changes

Update print message for subsampling loo per feedback

09d56b2

VisruthSK self-requested a review March 19, 2026 17:55

Uh oh!

Conversation

ishaan-arora-1 commented Feb 21, 2026

Changes

Updated methods

Also updated

Uh oh!

avehtari commented Feb 23, 2026

Uh oh!

ishaan-arora-1 commented Feb 23, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

avehtari commented Feb 23, 2026

Uh oh!

VisruthSK commented Feb 23, 2026

Uh oh!

ishaan-arora-1 commented Feb 23, 2026

Uh oh!

VisruthSK commented Feb 23, 2026

Uh oh!

ishaan-arora-1 commented Feb 23, 2026

Uh oh!

ishaan-arora-1 commented Feb 24, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

codecov-commenter commented Feb 24, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

VisruthSK commented Feb 24, 2026

Uh oh!

ishaan-arora-1 commented Feb 24, 2026

Uh oh!

ishaan-arora-1 commented Feb 25, 2026

Uh oh!

avehtari commented Mar 6, 2026

Uh oh!

avehtari commented Mar 18, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

ishaan-arora-1 commented Feb 23, 2026 •

edited

Loading

ishaan-arora-1 commented Feb 24, 2026 •

edited

Loading

codecov-commenter commented Feb 24, 2026 •

edited

Loading